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<210> 1 

<211> 4027 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> CDS 

<222> (57) . . (3452) 

<400> 1 

caggcagcgt ggtcctgctg cgcacgtggg aagccctggc cccggccacc cccgcg atg 59 

Met 
1 

ccg cgc get ccc cgc tgc cga gec gtg cgc tec ctg ctg cgc age cac 107 
Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser His 
5 10 15 

tac cgc gag gtg ctg ccg ctg gec acg ttc gtg egg cgc ctg ggg ccc 155 
Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly Pro 
20 25 30 

cag ggc tgg egg ctg gtg cag cgc ggg gac ccg gcg get ttc cgc gcg 203 
Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg Ala 
35 40 45 

ctg gtg gec cag tgc ctg gtg tgc gtg ccc tgg gac gca egg ccg ccc 251 
Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro Pro 
50 55 60 65 



ccc gcc gcc ccc tec ttc cgc cag gtg tec tgc ctg aag gag ctg gtg 
Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu Val 
70 75 80 



299 



gcc cga gtg ctg cag agg ctg tgc gag cgc ggc gcg aag aac gtg ctg 347 
Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val Leu 
85 90 95 

gcc ttc ggc ttc gcg ctg ctg gac ggg gcc cgc ggg ggc ccc ccc gag 395 
Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro Glu 
100 105 110 



gcc ttc acc acc age gtg cgc age tac ctg ccc aac acg gtg acc gac 
Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr Asp 
115 120 125 



gac gac gtg ctg gtt cac ctg ctg gca cgc tgc gcg etc ttt gtg ctg 
Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val Leu 
150 155 160 

gtg get ccc age tgc gcc tac cag gtg tgc ggg ccg ccg ctg tac cag 
Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr Gin 
165 170 175 

etc ggc get gcc act cag gcc egg ccc ccg cca cac get agt gga ccc 
Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly Pro 
180 185 190 

cga agg cgt ctg gga tgc gaa egg gcc tgg aac cat age gtc agg gag 
Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg Glu 
195 200 205 



443 



gca ctg egg ggg age ggg gcg tgg ggg ctg ctg ttg cgc cgc gtg ggc 4 91 
Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val Gly 
130 135 140 145 



539 



587 



635 



683 



gcc ggg gtc ccc ctg ggc ctg cca gcc ccg ggt gcg agg agg cgc ggg 731 
Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg Gly 
210 215 220 225 

ggc agt gcc age cga agt ctg ccg ttg ccc aag agg ccc agg cgt ggc 77 9 
Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg Gly 
230 235 240 

get gcc cct gag ccg gag egg acg ccc gtt ggg cag ggg tec tgg gcc 827 
Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp Ala 
245 250 255 

cac ccg ggc agg acg cgt gga ccg agt gac cgt ggt ttc tgt gtg gtg 
His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val Val 
260 265 270 

tea cct gcc aga ccc gcc gaa gaa gcc acc tct ttg gag ggt gcg etc 
Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala Leu 
275 280 285 

tct ggc acg, cgc cac tec cac cca tec gtg ggc cgc cag cac cac gcg 971 



875 



923 



Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His Ala 
290 295 300 305 



ggc ccc cca tec aca teg egg cca cca cgt ccc tgg gac acg cct tgt 
Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro Cys 
310 315 320 

ccc ccg gtg tac gee gag ace aag cac ttc etc tac tec tea ggc gac 
Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly Asp 
325 330 335 

aag gag cag ctg egg ccc tec ttc eta etc age tct ctg agg ccc age 
Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro Ser 
340 345 350 

ctg act ggc get egg agg etc gtg gag acc ate ttt ctg ggt tec agg 
Leu Thr Gly Ala Arg Arg Leu Val Glu Thr He Phe Leu Gly Ser Arg 
355 360 365 

ccc tgg atg cca ggg act ccc cgc agg ttg ccc cgc ctg ccc cag cgc 
Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin Arg 
370 375 380 385 

tac tgg caa atg egg ccc ctg ttt ctg gag ctg ctt ggg aac cac gcg 
Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His Ala 
390 395 400 

cag tgc ccc tac ggg gtg etc etc aag acg cac tgc ccg ctg cga get 
Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg Ala 
405 410 415 

gcg gtc acc cca gca gee ggt gtc tgt gee egg gag aag ccc cag ggc 
Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin Gly 
420 425 430 

tct gtg gcg gee ccc gag gag gag gac aca gac ccc cgt cgc ctg gtg 
Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu Val 
435 440 445 

cag ctg etc cgc cag cac age age ccc tgg cag gtg tac ggc ttc gtg 
Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe Val 
450 455 460 465 

egg gee tgc ctg cgc egg ctg gtg ccc cca ggc etc tgg ggc tec agg 
Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser Arg 
470 475 480 

cac aac gaa cgc cgc ttc etc agg aac acc aag aag ttc ate tec ctg 
His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe He Ser Leu 
485 490 495 

ggg aag cat gee aag etc teg ctg cag gag ctg acg tgg aag atg age 
Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met Ser 
500 505 510 



gtg egg ggc tgc get tgg ctg cgc agg age cca ggg gtt ggc tgt gtt 
Val Arg Gly Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys Val 



515 520 525 

ccg gcc gca gag cac cgt ctg cgt gag gag ate ctg gec aag ttc ctg 
Pro Ala Ala Glu His Arg Leu Arg Glu Glu He Leu Ala Lys Phe Leu 
530 535 540 545 

cac tgg ctg atg agt gtg tac gtc gtc gag ctg etc agg tct ttc ttt 
His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe Phe 
550 555 560 

tat gtc acg gag acc acg ttt caa aag aac agg etc ttt ttc tac egg 
Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr Arg 
565 570 575 

aag agt gtc tgg age aag ttg caa age att gga ate aga cag cac ttg 
Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His Leu 
580 585 590 

aag agg gtg cag ctg egg gag ctg teg gaa gca gag gtc agg cag cat 
Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin His 
595 600 605 

egg gaa gcc agg ccc gcc ctg ctg acg tec aga etc cgc ttc ate ccc 
Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He Pro 
610 615 620 625 

aag cct gac ggg ctg egg ccg att gtg aac atg gac tac gtc gtg gga 
Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val Gly 
630 635 640 

gcc aga acg ttc cgc aga gaa aag agg gcc gag cgt etc acc teg agg 
Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser Arg 
645 650 655 

gtg aag gca ctg ttc age gtg etc aac tac gag egg gcg egg cgc ccc 
Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg Pro 
660 665 670 

ggc etc ctg ggc gcc tct gtg ctg ggc ctg gac gat ate cac agg gcc 
Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg Ala 
675 680 685 

tgg cgc acc ttc gtg ctg cgt gtg egg gcc cag gac ccg ccg cct gag 
Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro Glu 
690 695 700 705 

ctg tac ttt gtc aag gtg gat gtg acg ggc gcg tac gac acc ate ccc 
Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He Pro 
710 715 720 

cag gac agg etc acg gag gtc ate gcc age ate ate aaa ccc cag aac 
Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin Asn 
725 730 735 



acg tac tgc gtg cgt egg tat gcc gtg gtc cag aag gcc gcc cat ggg 
Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His Gly 
740 745 750 



cac gtc cgc 
His Val Arg 
755 

cag ccg tac 
Gin Pro Tyr 
770 

ctg agg gat 
Leu Arg Asp 



age agt ggc 
Ser Ser Gly 



gtg cgc ate 
Val Arg He 
820 

ggc tec ate 
Gly Ser He 
835 

gag aac aag 
Glu Asn Lys 
850 

ttg gtg gat 
Leu Val Asp 



acc ttc etc 
Thr Phe Leu 



gtg aac ttg 
Val Asn Leu 
900 

ctg ggt ggc 
Leu Gly Gly 
915 

tgg tgc ggc 
Trp Cys Gly 
930 

tac tec age 
Tyr Ser Ser 



cgc ggc ttc 
Arg Gly Phe 



aag gec ttc 
Lys Ala Phe 



atg cga cag 
Met Arg Gin 
775 

gec gtc gtc 
Ala Val Val 
790 

etc ttc gac 
Leu Phe Asp 
805 

agg ggc aag 
Arg Gly Lys 



etc tec acg 
Leu Ser Thr 



ctg ttt gcg 
Leu Phe Ala 
855 

gat ttc ttg 
Asp Phe Leu 
870 

agg acc ctg 
Arg Thr Leu 
885 

egg aag aca 
Arg Lys Thr 



acg get ttt 
Thr Ala Phe 



ctg ctg ctg 
Leu Leu Leu 
935 

tat gee egg 
Tyr Ala Arg 
950 

aag get ggg 
Lys Ala Gly 
965 



aag age cac 
Lys Ser His 
760 

ttc gtg get 
Phe Val Ala 



ate gag cag 
He Glu Gin 



gtc ttc eta 
Val Phe Leu 
810 

tec tac gtc 
Ser Tyr Val 
825 

ctg etc tgc 
Leu Leu Cys 
840 

ggg att egg 
Gly He Arg 



ttg gtg aca 
Leu Val Thr 



gtc cga ggt 
Val Arg Gly 
890 

gtg gtg aac 
Val Val Asn 
905 

gtt cag atg 
Val Gin Met 
920 

gat acc egg 
Asp Thr Arg 



acc tec ate 
Thr Ser He 



agg aac atg 
Arg Asn Met 
970 



gtc tct acc 
Val Ser Thr 
765 

cac ctg cag 
His Leu Gin 
780 

age tec tec 
Ser Ser Ser 
795 

cgc ttc atg 
Arg Phe Met 



cag tgc cag 
Gin Cys Gin 



age ctg tgc 
Ser Leu Cys 
845 

egg gac ggg 
Arg Asp Gly 
860 

cct cac etc 
Pro His Leu 
875 

gtc cct gag 
Val Pro Glu 



ttc cct gta 
Phe Pro Val 



ccg gec cac 
Pro Ala His 
925 

acc ctg gag 
Thr Leu Glu 
940 

aga gee agt 
Arg Ala Ser 
955 

cgt cgc aaa 
Arg Arg Lys 



ttg aca gac 
Leu Thr Asp 



gag acc age 
Glu Thr Ser 



ctg aat gag 
Leu Asn Glu 
800 

tgc cac cac 
Cys His His 
815 

ggg ate ccg 
Gly He Pro 
830 

tac ggc gac 
Tyr Gly Asp 



ctg etc ctg 
Leu Leu Leu 



acc cac gcg 
Thr His Ala 
880 

tat ggc tgc 
Tyr Gly Cys 
895 

gaa gac gag 
Glu Asp Glu 
910 

ggc eta ttc 
Gly Leu Phe 



gtg cag age 
Val Gin Ser 



etc acc ttc 
Leu Thr Phe 
960 

etc ttt ggg 
Leu Phe Gly 
975 



etc 2363 
Leu 



ccg 2411 

Pro 

785 

gee 2459 
Ala 



gee 2507 
Ala 



cag 2555 
Gin 



atg 2603 
Met 



cgt 2651 

Arg 

865 

aaa 2699 
Lys 



gtg 2747 
Val 



gec 2795 
Ala 



ccc 2843 
Pro 



gac 2891 

Asp 

945 

aac 2939 
Asn 



gtc 2987 
Val 



ttg egg ctg aag tgt cac age ctg ttt ctg gat ttg cag gtg aac age 3035 
Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn Ser 
980 985 990 

etc cag acg gtg tgc acc aac ate tac aag ate etc ctg ctg cag gcg 3083 
Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin Ala 
995 1000 1005 

tac agg ttt cac gca tgt gtg ctg cag etc cca ttt cat cag caa gtt 3131 
Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin Val 
1010 1015 1020 1025 

tgg aag aac ccc aca ttt ttc ctg cgc gtc ate tct gac acg gee tec 3179 
Trp Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser Asp Thr Ala Ser 
1030 1035 1040 

etc tgc tac tec ate ctg aaa gee aag aac gca ggg atg teg ctg ggg 3227 
Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu Gly 
1045 1050 1055 

gee aag ggc gec gee ggc cct ctg ccc tec gag gee gtg cag tgg ctg 3275 
Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp Leu 
1060 1065 1070 

tgc cac caa gca ttc ctg etc aag ctg act cga cac cgt gtc acc tac 3323 
Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr Tyr 
1075 1080 1085 

gtg cca etc ctg ggg tea etc agg aca gee cag acg cag ctg agt egg 3371 
Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser Arg 
1090 1095 1100 1105 

aag etc ccg ggg acg acg ctg act gee ctg gag gee gca gee aac ccg 3419 
Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn Pro 
1110 1115 1120 

gca ctg ccc tea gac ttc aag acc ate ctg gac tgatggccac ccgcccacag 3472 
Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1125 1130 

ccaggccgag agcagacacc agcagccctg tcacgccggg ctctacgtcc cagggaggga 3532 

ggggcggccc acacccaggc ccgcaccgct gggagtctga ggcctgagtg agtgtttggc 3592 

cgaggcctgc atgtccggct gaaggctgag tgtccggctg aggectgage gagtgtccag 3652 

ccaagggctg agtgtccagc acacctgccg tcttcacttc cccacaggct ggcgctcggc 3712 

tccaccccag ggccagcttt tcctcaccag gagcccggct tccactcccc acataggaat 3772 

agtccatccc cagattcgcc attgttcacc cctcgccctg ccctcctttg ccttccaccc 3832 

ccaccatcca ggtggagacc ctgagaagga ccctgggagc tctgggaatt tggagtgacc 3892 

aaaggtgtgc cctgtacaca ggegaggace ctgcacctgg atgggggtcc ctgtgggtca 3952 

aattgggggg aggtgctgtg ggagtaaaat actgaatata tgagtttttc agttttgaaa 4012 



aaaaaaaaaa aaaaa 



<210> 2 
<211> 1132 
<212> PRT 

<213> Homo sapiens 
<400> 2 

Met Pro Arg Ala Pro Arg Cys Arg Ala Val Arg Ser Leu Leu Arg Ser 
15 10 15 

His Tyr Arg Glu Val Leu Pro Leu Ala Thr Phe Val Arg Arg Leu Gly 
20 25 30 

Pro Gin Gly Trp Arg Leu Val Gin Arg Gly Asp Pro Ala Ala Phe Arg 
35 40 ' 45 

Ala Leu Val Ala Gin Cys Leu Val Cys Val Pro Trp Asp Ala Arg Pro 
50 55 60 

Pro Pro Ala Ala Pro Ser Phe Arg Gin Val Ser Cys Leu Lys Glu Leu 
65 70 75 80 

Val Ala Arg Val Leu Gin Arg Leu Cys Glu Arg Gly Ala Lys Asn Val 
85 90 95 

Leu Ala Phe Gly Phe Ala Leu Leu Asp Gly Ala Arg Gly Gly Pro Pro 
100 105 110 

Glu Ala Phe Thr Thr Ser Val Arg Ser Tyr Leu Pro Asn Thr Val Thr 
115 120 125 

Asp Ala Leu Arg Gly Ser Gly Ala Trp Gly Leu Leu Leu Arg Arg Val 
130 135 140 

Gly Asp Asp Val Leu Val His Leu Leu Ala Arg Cys Ala Leu Phe Val 
145 150 155 160 

Leu Val Ala Pro Ser Cys Ala Tyr Gin Val Cys Gly Pro Pro Leu Tyr 
165 170 175 

Gin Leu Gly Ala Ala Thr Gin Ala Arg Pro Pro Pro His Ala Ser Gly 
180 185 190 

Pro Arg Arg Arg Leu Gly Cys Glu Arg Ala Trp Asn His Ser Val Arg 
195 200 205 

Glu Ala Gly Val Pro Leu Gly Leu Pro Ala Pro Gly Ala Arg Arg Arg 
210 215 220 

Gly Gly Ser Ala Ser Arg Ser Leu Pro Leu Pro Lys Arg Pro Arg Arg 
225 230 235 240 



Gly Ala Ala Pro Glu Pro Glu Arg Thr Pro Val Gly Gin Gly Ser Trp 
245 250 255 



Ala His Pro Gly Arg Thr Arg Gly Pro Ser Asp Arg Gly Phe Cys Val 
260 265 270 



Val Ser Pro Ala Arg Pro Ala Glu Glu Ala Thr Ser Leu Glu Gly Ala 
275 280 285 

Leu Ser Gly Thr Arg His Ser His Pro Ser Val Gly Arg Gin His His 
290 295 300 

Ala Gly Pro Pro Ser Thr Ser Arg Pro Pro Arg Pro Trp Asp Thr Pro 
305 310 315 320 

Cys Pro Pro Val Tyr Ala Glu Thr Lys His Phe Leu Tyr Ser Ser Gly 
325 330 335 

Asp Lys Glu Gin Leu Arg Pro Ser Phe Leu Leu Ser Ser Leu Arg Pro 
340 345 350 

Ser Leu Thr Gly Ala Arg Arg Leu Val Glu Thr lie Phe Leu Gly Ser 
355 360 365 

Arg Pro Trp Met Pro Gly Thr Pro Arg Arg Leu Pro Arg Leu Pro Gin 
370 375 380 

Arg Tyr Trp Gin Met Arg Pro Leu Phe Leu Glu Leu Leu Gly Asn His 
385 390 395 400 

Ala Gin Cys Pro Tyr Gly Val Leu Leu Lys Thr His Cys Pro Leu Arg 
405 410 415 

Ala Ala Val Thr Pro Ala Ala Gly Val Cys Ala Arg Glu Lys Pro Gin 
420 425 430 

Gly Ser Val Ala Ala Pro Glu Glu Glu Asp Thr Asp Pro Arg Arg Leu 
435 440 445 

Val Gin Leu Leu Arg Gin His Ser Ser Pro Trp Gin Val Tyr Gly Phe 
450 455 460 

Val Arg Ala Cys Leu Arg Arg Leu Val Pro Pro Gly Leu Trp Gly Ser 
465 470 475 480 

Arg His Asn Glu Arg Arg Phe Leu Arg Asn Thr Lys Lys Phe lie Ser 
485 490 495 

Leu Gly Lys His Ala Lys Leu Ser Leu Gin Glu Leu Thr Trp Lys Met 
500 505 510 

Ser Val Arg Gly Cys Ala Trp Leu Arg Arg Ser Pro Gly Val Gly Cys 
515 520 525 



Val Pro Ala Ala Glu His Arg Leu Arg Glu Glu lie Leu Ala Lys Phe 
530 535 540 



Leu His Trp Leu Met Ser Val Tyr Val Val Glu Leu Leu Arg Ser Phe 
545 550 555 560 



Phe Tyr Val Thr Glu Thr Thr Phe Gin Lys Asn Arg Leu Phe Phe Tyr 
565 570 575 



Arg Lys Ser Val Trp Ser Lys Leu Gin Ser He Gly He Arg Gin His 
580 585 590 

Leu Lys Arg Val Gin Leu Arg Glu Leu Ser Glu Ala Glu Val Arg Gin 
595 600 605 

His Arg Glu Ala Arg Pro Ala Leu Leu Thr Ser Arg Leu Arg Phe He 
610 615 620 

Pro Lys Pro Asp Gly Leu Arg Pro He Val Asn Met Asp Tyr Val Val 
625 630 635 640 

Gly Ala Arg Thr Phe Arg Arg Glu Lys Arg Ala Glu Arg Leu Thr Ser 
645 650 655 

Arg Val Lys Ala Leu Phe Ser Val Leu Asn Tyr Glu Arg Ala Arg Arg 
660 665 670 

Pro Gly Leu Leu Gly Ala Ser Val Leu Gly Leu Asp Asp He His Arg 
675 680 685 

Ala Trp Arg Thr Phe Val Leu Arg Val Arg Ala Gin Asp Pro Pro Pro 
690 695 700 

Glu Leu Tyr Phe Val Lys Val Asp Val Thr Gly Ala Tyr Asp Thr He 
705 710 715 720 

Pro Gin Asp Arg Leu Thr Glu Val He Ala Ser He He Lys Pro Gin 
725 730 735 

Asn Thr Tyr Cys Val Arg Arg Tyr Ala Val Val Gin Lys Ala Ala His 
740 745 750 

Gly His Val Arg Lys Ala Phe Lys Ser His Val Ser Thr Leu Thr Asp 
755 760 765 

Leu Gin Pro Tyr Met Arg Gin Phe Val Ala His Leu Gin Glu Thr Ser 
770 775 780 

Pro Leu Arg Asp Ala Val Val He Glu Gin Ser Ser Ser Leu Asn Glu 
785 790 795 800 

Ala Ser Ser Gly Leu Phe Asp Val Phe Leu Arg Phe Met Cys His His 
805 810 815 

Ala Val Arg He Arg Gly Lys Ser Tyr Val Gin Cys Gin Gly He Pro 
820 825 830 



Gin Gly Ser He Leu Ser Thr Leu Leu Cys Ser Leu Cys Tyr Gly Asp 
835 840 845 



Met Glu Asn Lys Leu Phe Ala Gly He Arg Arg Asp Gly Leu Leu Leu 
850 855 860 



Arg Leu Val Asp Asp Phe Leu Leu Val Thr Pro His Leu Thr His Ala 
865 870 875 880 



Lys Thr Phe Leu Arg Thr Leu Val Arg Gly Val Pro Glu Tyr Gly Cys 
885 890 895 

Val Val Asn Leu Arg Lys Thr Val Val Asn Phe Pro Val Glu Asp Glu 
900 905 910 

Ala Leu Gly Gly Thr Ala Phe Val Gin Met Pro Ala His Gly Leu Phe 
915 920 925 

Pro Trp Cys Gly Leu Leu Leu Asp Thr Arg Thr Leu Glu Val Gin Ser 
930 935 940 

Asp Tyr Ser Ser Tyr Ala Arg Thr Ser lie Arg Ala Ser Leu Thr Phe 
945 950 955 960 

Asn Arg Gly Phe Lys Ala Gly Arg Asn Met Arg Arg Lys Leu Phe Gly 
965 970 975 

Val Leu Arg Leu Lys Cys His Ser Leu Phe Leu Asp Leu Gin Val Asn 
980 985 990 

Ser Leu Gin Thr Val Cys Thr Asn lie Tyr Lys lie Leu Leu Leu Gin 
995 1000 1005 

Ala Tyr Arg Phe His Ala Cys Val Leu Gin Leu Pro Phe His Gin Gin 
1010 1015 1020 

Val Trp Lys Asn Pro Thr Phe Phe Leu Arg Val lie Ser Asp Thr Ala 
1025 1030 1035 1040 

Ser Leu Cys Tyr Ser lie Leu Lys Ala Lys Asn Ala Gly Met Ser Leu 
1045 1050 1055 

Gly Ala Lys Gly Ala Ala Gly Pro Leu Pro Ser Glu Ala Val Gin Trp 
1060 1065 1070 

Leu Cys His Gin Ala Phe Leu Leu Lys Leu Thr Arg His Arg Val Thr 
1075 1080 1085 

Tyr Val Pro Leu Leu Gly Ser Leu Arg Thr Ala Gin Thr Gin Leu Ser 
1090 1095 1100 

Arg Lys Leu Pro Gly Thr Thr Leu Thr Ala Leu Glu Ala Ala Ala Asn 
1105 1110 1115 1120 



Pro Ala Leu Pro Ser Asp Phe Lys Thr lie Leu Asp 
1125 1130 



<210> 3 
<211> 2372 
<212> DNA 



<213> Homo sapiens 



<400> 3 

gcaccgcgcg 

gcaagaagcc 

cagccaggag 

gagagtggaa 

aaggaaactg 

ggagcaggca 

cacagattcc 

tattaaagtc 

ttggccagta 

gttcaaatga 

ggaaaatata 

actcaggtac 

accttgtaca 

ctacctcatc 

gtgaacgaca 

tggctctgtg 

ggacgccatc 

atcaggattc 

aagattatag 

aagttactgt 

tttccttagc 

cacattgcaa 

ataaagggga 

ttgatgttcc 

aaaatgatga 

catcaacttc 

aagaaaccca 

cttgtgtgat 

atcttatggc 

tatgtagaca 

aagagaatta 

cacatatatc 

tgacctactt 

catcctttac 

ttggtttttt 

agaccgagtc 

agctctgccc 

tacagtcatc 

accgtgttag 

caaagtgctg 



agcttggctg 
gagcccgagg 
caccgtccct 
tgatccccga 
gggagtcttg 
aatgtgcaat 
agcttcggaa 
tgttggtgca 
tattatgact 
tcttctagga 
taccatgatc 
atctgtgagt 
agagcttcag 
tagaaggaga 
aagaaaacgc 
tgtaataagg 
gaatccggat 
agtttcagat 
ccttagtgaa 
gtatcaggca 
tgactattgg 
cagatgttgg 
aatctctgag 
tgattgtaaa 
taaaattaca 
tagtagcatt 
agacaaagaa 
ttgtcaaggt 
ctgctttaca 
accaattcaa 
tatatttcta 
aaagtgagaa 
tggtagtgga 
accaactcct 
ttttcttaaa 
ttgctctgtt 
tccccgggtt 
tgccaccaca 
ccaggatggt 
ggattacagg 



cttctggggc 
ggcggccgcg 
ccccggatta 
ggcccagggc 
agggaccccc 
accaacatgt 
caagagaccc 
caaaaagaca 
aaacgattat 
gatttgtttg 
tacaggaact 
gagaacaggt 
gaagagaaac 
gcaattagtg 
cacaaatctg 
gagatatgtt 
cttgatgctg 
cagtttagtg 
gaaggacaag 
ggggagagtg 
aaatgcactt 
gcccttcgtg 
aaagccaaac 
aaaactatag 
caagcttcac 
atttatagca 
gagagtgtgg 
cgacctaaaa 
tgtgcaaaga 
atgattgtgc 
actatataac 
aatgcctcaa 
atagtgaata 
aattttaaat 
tatgtatatg 
acccaggctg 
cgcaccattc 
cctggctaat 
ctcgatctcc 
catgagccac 



ctgtgtggcc 
acccctctga 
gtgcgtacga 
gtcgtgcttc 
gactccaagc 
ctgtacctac 
tggttagacc 
cttatactat 
atgatgagaa 
gcgtgccaag 
tggtagtagt 
gtcaccttga 
cttcatcttc 
agacagaaga 
atagtatttc 
gtgaaagaag 
gtgtaagtga 
tagaatttga 
aactctcaga 
atacagattc 
catgcaatga 
agaattggct 
tggaaaactc 
tgaatgattc 
aatcacaaga 
gccaagaaga 
aatctagttt 
atggttgcat 
agctaaagaa 
taacttattt 
cctaggaatt 
ttcacataga 
cttactataa 
aatttctact 
acatttaaat 
gagtgcagtg 
tcctgcctca 
tttttgtact 
tgacctcgtg 
eg 



ctgtgtgtcg 
ccgagatcct 
gcgcccagtg 
cgcagtagtc 
gcgaaaaccc 
tgatggtgct 
aaagccattg 
gaaagaggtt 
gcaacaacat 
cttctctgtg 
caatcagcag 
aggtgggagt 
acatttggtt 
aaattcagat 
cctttccttt 
cagtagcagt 
acattcaggt 
agttgaatct 
tgaagatgat 
atttgaagaa 
aatgaatccc 
tcctgaagat 
aacacaagct 
cagagagtca 
aagtgaagac 
tgtgaaagag 
gccccttaat 
tgtccatggc 
aaggaataag 
cccctagttg 
tagacaacct 
tttcttctct 
tttgacttga 
ctgtcttaaa 
gtaacttatt 
ggtgatcttg 
gcctcccaat 
tttagtagag 
atccgcccac 



gaaagatgga 
getgettteg 
ccctggcccg 
agtccccgtg 
cggatggtga 
gtaaccacct 
cttttgaagt 
cttttttatc 
attgtatatt, 
aaagagcaca 
gaatcategg 
gatcaaaagg 
tctagaccat 
gaattatctg 
gatgaaagee 
gaatctacag 
gattggttgg 
ctcgactcag 
gaggtatatc 
gatcctgaaa 
ccccttccat 
aaagggaaag 
gaagagggct 
tgtgttgagg 
tattctcagc 
tttgaaaggg 
gecattgaac 
aaaacaggac 
ccctgcccag 
acctgtctat 
gaaatttatt 
ttagtataat 
atatgtagct 
tgagaagtac 
attttttttg 
gctcactgca 
tagcttggee 
acagggtttc 
ctcggcctcc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2372 



<210> 4 
<211> 491 
<212> PRT 

<213> Homo sapiens 
<400> 4 

Met Cys Asn Thr Asn Met Ser Val Pro Thr Asp Gly Ala Val Thr Thr 
15 10 15 



Ser Gin lie Pro Ala Ser Glu Gin Glu Thr Leu Val Arg Pro Lys Pro 
20 25 30 



Leu Leu Leu Lys Leu Leu Lys Ser Val Gly Ala Gin Lys Asp Thr Tyr 
35 40 45 



Thr Met Lys Glu Val Leu Phe Tyr Leu Gly Gin Tyr lie Met Thr Lys 
50 55 60 

Arg Leu Tyr Asp Glu Lys Gin Gin His lie Val Tyr Cys Ser Asn Asp 
65 70 75 80 

Leu Leu Gly Asp Leu Phe Gly Val Pro Ser Phe Ser Val Lys Glu His 
85 90 95 

Arg Lys lie Tyr Thr Met lie Tyr Arg Asn Leu Val Val Val Asn Gin 
100 . 105 110 

Gin Glu Ser Ser Asp Ser Gly Thr Ser Val Ser Glu Asn Arg Cys His 
115 120 125 

Leu Glu Gly Gly Ser Asp Gin Lys Asp Leu Val Gin Glu Leu Gin Glu 
130 135 140 

Glu Lys Pro Ser Ser Ser His Leu Val Ser Arg Pro Ser Thr Ser Ser 
145 150 155 160 

Arg Arg Arg Ala lie Ser Glu Thr Glu Glu Asn Ser Asp Glu Leu Ser 
165 170 175 

Gly Glu Arg Gin Arg Lys Arg His Lys Ser Asp Ser lie Ser Leu Ser 
180 185 190 

Phe Asp Glu Ser Leu Ala Leu Cys Val lie Arg Glu lie Cys Cys Glu 
195 200 205 

Arg Ser Ser Ser Ser Glu Ser Thr Gly Thr Pro Ser Asn Pro Asp Leu 
210 215 220 

Asp Ala Gly Val Ser Glu His Ser Gly Asp Trp Leu Asp Gin Asp Ser 
225 230 235 240 



Val Ser Asp Gin Phe Ser Val Glu Phe Glu Val Glu Ser Leu Asp Ser 
245 250 255 



Glu Asp Tyr Ser 
260 

Asp Glu Val Tyr 
275 

Asp Ser Phe Glu 
290 

Cys Thr Ser Cys 
305 

Arg Cys Trp Ala 



Leu Ser Glu Glu 



Gin Val Thr Val 
280 

Glu Asp Pro Glu 
295 

Asn Glu Met Asn 
310 

Leu Arg Glu Asn 
325 



Gly Gin Glu Leu 
265 

Tyr Gin Ala Gly 



lie Ser Leu Ala 
300 

Pro Pro Leu Pro 
315 

Trp Leu Pro Glu 
' 330 



Ser Asp Glu Asp 
270 

Glu Ser Asp Thr 
285 

Asp Tyr Trp Lys 



Ser His Cys Asn 
320 

Asp Lys Gly Lys 
335 



Asp Lys Gly Glu lie Ser Glu Lys Ala Lys Leu Glu Asn Ser Thr Gin 
340 345 350 



Ala Glu Glu Gly 
355 

Asp Ser Arg Glu 
370 

Ala Ser Gin Ser 
385 

Ser Ser lie lie 



Glu Glu Thr Gin 
420 

Asn Ala lie Glu 
435 

Cys lie Val His 
450 

Ala Lys Lys Leu 
465 

Pro He Gin Met 



Phe Asp Val Pro 
360 

Ser Cys Val Glu 
375 

Gin Glu Ser Glu 
390 

Tyr Ser Ser Gin 
405 

Asp Lys Glu Glu 



Pro Cys Val He 
440 

Gly Lys Thr Gly 
455 

Lys Lys Arg Asn 
470 

He Val Leu Thr 
485 



Asp Cys Lys Lys 



Glu Asn Asp Asp 
380 

Asp Tyr Ser Gin 
395 

Glu Asp Val Lys 
410 

Ser Val Glu Ser 
425 

Cys Gin Gly Arg 



His Leu Met Ala 
460 

Lys Pro Cys Pro 
475 

Tyr Phe Pro 
490 



Thr He Val Asn 
365 

Lys He Thr Gin 



Pro Ser Thr Ser 
400 

Glu Phe Glu Arg 
415 

Ser Leu Pro Leu 
430 

Pro Lys Asn Gly 
445 

Cys Phe Thr Cys 



Val Cys Arg Gin 
480 



<210> 5 

<211> 4233 

<212> DRA 

<213> Homo sapiens 



<220> 
<223> M n" 



at various positions may be a, t, c, g, other or unknown 



<400> 5 

ccctcctccc 

cccagaacgt 

tcacatggtg 

ggccctctgc 

tccggggccc 

gtcccttccc 

tcgcgagctg 

gaaagtataa 

gatctttgtg 

cgtggctgga 

actgggagcg 

ttgttgctgc 

gattgtaggg 

attggtgtcg 

gtggccctca 

agcacagttc 

gtccggtgag 



agtcgaagca 
ccggcgttcg 
agggtggggg 
gtccagctgc 
cgataacggg 
aggaatgaga 
gttatggaag 
ttttagggac 
cggggcgcga 
ggggttggta 
ggggatggga 
aggtcatacc 
tctcccttga 
gtgcctatgg 
agagtgtgag 
gtgaggtggc 
aaggtggtgg 



cctcctgtcc 
ccccgccctc 
tgagggggcc 
tccggaccga 
ccgcccccac 
accagtgccc 
ggtcgctcaa 
tgaggtgtag 
ttgtcccaaa 
ttgagtcatt 
tgctggtggt 
atcctaactc 
tctgagaatg 
gacagtgtac 
agtccccaat 
tttactgagg 
agggttgggc 



gcccctcagc 
ccagtttccg 
tctctagctt 
gctcgggtgt 
agcaccccgg 
gcccccctca 
gggcgggaag 
gatcttcgat 
ggaaaaagcg 
gtgttatctc 
gttctttgcg 
tgtaagcgac 
gctacctctc 
aaggcccgtg 
ggaggaggag 
cgactggagg 
gtggggagta 



gcatgggtgg 
cgcgcctctt 
gcggcctgtg 
atggggccgt 
gctggcgtga 
cagctttcca 
tggggccttt 
gcaaggcatg 
ttttctattg 
tggggccggc 
cttttttttt 
ttttggtgat 
gatatgagcc 
atccccacag 
gtggaggagg 
cttttgagca 
aagggaaaag 



cggtcacgtg 
tggcagctgg 
tctatggtcg 
aggaaccggc 
ggtaagtgca 
cgcgttcgtt 
gtggtcatgg 
tgtcatgtgt 
cagggcctca 
cccaaggaag 
gggagtccct 
aggagtctgt 
agtggctgaa 
tggccacttt 
ccttcccatc 
tcccaatgtt 
acagcctata 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 



ggtggggtgt gatgatctgt agagaagtgg 
tgggttaaag gggattgaaa agtgagcatt 
cacatcccga actgaccggg agatcaaggt 
cctaaggaca tatctggaca aggcaccccc 
gagtggggtt ggtaggcatt gagaggtgga 
tttcaggtat ggtgcctagt ttccagtgca 
gatgcgccag tttctaagag gcctagattt 
tctgaagcca gagaacattc tggtgacaag 
cctggccaga atctacagct accagatggc 
accaaaatgg gttctggttg ggaataggag 
gtgcttcatg tgttcagtca agcaagttgt 
attcttccta ttccctttag gttgttacac 
agtccacata tgcaacacct gtggacatgt 
ttcgtcgaaa gtatgggacc cacataccct 
tcataaacca catccatacc ttgcccattc 
gcccaggctg gattgcaatg gcatgatcac 
aagtgatcct cccatctcag cttcccaact 
ttggctagtt tgttaatatt tttatagaga 
ttgaactctt gcactcaagc aatcctccca 
gcatctggcc ccattctttt acttgcgtac 
aatgttctgg aattaggaac ttcatatccc 
tatttattta tttatttatt gagataaggt 
ggcccaatta nagctcactg tancctctac 
gccccttgag tanctgagac taaaggtgca 
gatggagtct tgctctgtcg ccaggctgga 
cctccacctc ccagattcaa gcaattctct 
aggtgcacgc caccatgctc agctaatttt 
gttggccagg atggtctcga tctcttgacc 
tgctaggatt acaggcgtga gcnnnngcac 
caaggtcttg cttgcccagg ctgatctaga 
agcatcccaa agtgctggaa ttgttggcct 
taagcagtaa ccattcaact aagatatctt 
caccttaggc ctctcttctg tggaaactct 
taagtgacca acatgggaga aaaagatttt 
ggatggcaac tggctctgcc atggggatgg 
cctgtccccc acagcctgat tgggctgcct 
ctgccccgtg gagcctttcc ccccagaggg 
atggaggagt cgggagcaca gctgctgctg 
agaaatagag actggggaaa gaaatagagc 
tgaaacttgg ttggtagatg gtctgtagtt 
agaaggatat agaagagaac ccatcctggc 
gcactttggg aggccaaggt gggcgtatca 
ccaacatggt gaaacctcgt ctctactaaa 
acgcctgtaa tctcagctac ttgggaggct 
cggaggttgc agtgagctga gatcatacca 
actccgtctn caaaaaaaaa aaaaaagaaa 
tgggtggcag aagctgtttt cttcatggtt 
gctgactttt aacccacaca agcgaatctc 
acataaggat gaaggtaatc cggagtgagc 
gctgccattt cccttctgga cactgagagg 
gagggtcctc ctccatcttt ctacagagat 
cacctctcct tttgaggctt ctccttctcc 
ccctcttgtc cctttcccta cctttatatt 
accaaaagaa awaatggccc tttttttttt 



ggaccctgag gaaataatga gaggccatgt 1080 
tactctggtc aggctgatgg acgtctgtgc 1140 
aaccctggtg tttgagcatg tagaccagga 1200 
accaggcttg ccagccgaaa cgatcaaggt 1260 
ttgggacctt tgtagtagaa ccttctggga 1320 
tctgtacctc cccctttgaa actaggatct 1380 
ccttcatgcc aattgcatcg ttcaccgaga 1440 
tggtggaaca gtcaagctgg ctgactttgg 1500 
acttacaccc gtggtcagta gaaagatggt 1560 
agtgattgcc cgtagcaatt gagaagtcat 1620 
gtttcatggt aacccatggg gtccccatcc 1680 
tctggtaccg agctcccgaa gttcttctgc 1740 
ggagtgttgg ctgtatcttt gcagagatgt 1800 
ggactacctt gaattcccca aatcgcttgt 1860 
tttttttttg agaccagggc ttgctgtgtt 1920 
agctcactgc agcttcaacc tcctgggctc 1980 
agctgacact acaggcacgc acctccatgc 2040 
tggggtctca gtatattgcc caggctggtc 2100 
cccctacctc ccaaagtagc ataagctact 2160 
tactaacttg cccatagcag aaagctctga 2220 
tttattctct ttatttttta tttatttatt 2280 
ttcactctgn nacccaggct ggagtncagt 2340 
ctcctgggct aaagmaatcc tcccatctca 2400 
cgccaccatg actggctttt ttttttttta 24 60 
gtgcagtagt gcgatctctg ctcactgcaa 2520 
tgactcagcc tcccaagtag ctgggaccac 2580 
tgtactttta gtaatgacag gtttcaccat 2640 
tcatgatcca cccacatcag actcccaaag 2700 
ctggcatttc ttttttttta aaaaaagaga 2760 
actcctgggc tcaagcagtc ctctcacctc 2820 
ttattcccta tacttcctat tttgagccac 2880 
tgaaaatgac tgctacctta tatcccttct 2940 
gaagccgacc agttgggcaa aatctttgag 3000 
ctattctgag tcctctttct gctgaaccca 3060 
gaactggagg accctcctga ccagagttct 3120 
ccagaggatg actggcctcg agatgtatcc. 3180 
ccccgcccag tgcagtcggt ggtacctgag 3240 
gtaactggag atggctgtgg gcacagggaa 3300 
agtatgcagg gccctggcca ctgtggttaa 3360 
tttattacag ctgcaaatag ccacccacag 3420 
tgggcacggt ggctcacgcc tgtaatccca 3480 
cctgaggtca ggagttcgag accagcctgg 3540 
agtacaaaaa taagccgggg gtggtggcac 3600 
gagataggag aatcacttca actcaggagg 3660 
ttggcactcc agcctgggtg atagagcgag 3720 
aaagaagaaa gctcatccca ggtattgttg 3780 
ttctgacctt tgcctctccc ctcaggaaat 3840 
tgcctttcga gctctgcagc actcttatct 3900 
aatggagtgg ctgccatgga aggaagaaaa 3960 
"gcaatctttg cctttatctc tgaggctatg 4020 
tactttgctg ccttaatgac attcccctcc 4080 
ttcccatttc tctacactaa ggggtatgtt 4140 
tggggtcctt ttttatacag gaaaaacaaa 4200 
ttt 4233 



<210> 6 



<211> 303 
<212> PRT 

<213> Homo sapiens 
<400> 6 

Met Ala Thr Ser Arg Tyr Glu Pro Val Ala Glu He Gly Val Gly Ala 
15 10 15 

Tyr Gly Thr Val Tyr Lys Ala Arg Asp Pro His Ser Gly His Phe Val 
20 25 30 

Ala Leu Lys Ser Val Arg Val Pro Asn Gly Gly Gly Gly Gly Gly Gly 
35 40 45 

Leu Pro He Ser Thr Val Arg Glu Val Ala Leu Leu Arg Arg Leu Glu 
50 55 60 

Ala Phe Glu His Pro Asn Val Val Arg Leu Met Asp Val Cys Ala Thr 
65 70 75 80 

Ser Arg Thr Asp Arg Glu He Lys Val Thr Leu Val Phe Glu His Val 
85 90 95 

Asp Gin Asp Leu Arg Thr Tyr Leu Asp Lys Ala Pro Pro Pro Gly Leu 
100 105 110 

Pro Ala Glu Thr He Lys Asp Leu Met Arg Gin Phe Leu Arg Gly Leu 
115 120 125 

Asp Phe Leu His Ala Asn Cys He Val His Arg Asp Leu Lys Pro Glu 
130 135 140 

Asn He Leu Val Thr Ser Gly Gly Thr Val Lys Leu Ala Asp Phe Gly 
145 150 155 160 

Leu Ala Arg He Tyr Ser Tyr Gin Met Ala Leu Thr Pro Val Val Val 
165 170 175 

Thr Leu Trp Tyr Arg Ala Pro Glu Val Leu Leu Gin Ser Thr Tyr Ala 
180 185 190 

Thr Pro Val Asp Met Trp Ser Val Gly Cys He Phe Ala Glu Met Phe 
195 200 205 

Arg Arg Lys Pro Leu Phe Cys Gly Asn Ser Glu Ala Asp Gin Leu Gly 
210 215 220 

Lys He Phe Asp Leu He Gly Leu Pro Pro Glu Asp Asp Trp Pro Arg 
225 230 235 240 

Asp Val Ser Leu Pro Arg Gly Ala Phe Pro Pro Arg Gly Pro Arg Pro 
245 250 255 

Val Gin Ser Val Val Pro Glu Met Glu Glu Ser Gly Ala Gin Leu Leu 
260 265 270 



Leu Glu Met Leu Thr Phe Asn Pro His Lys Arg He Ser Ala Phe Arg 



275 



280 



285 



Ala Leu Gin His Ser Tyr Leu His Lys Asp Glu Gly Asn Pro Glu 
290 295 300 



<210> 7 
<211> 105 
<212> PRT 

<213> Homo sapiens 
<400> 7 

Met His Gly Pro Lys Ala Thr Leu Gin Asp lie Val Leu His Leu Glu 
15 10 15 

Pro Gin Asn Glu lie Pro Val Asp Leu Leu Cys His Glu Gin Leu Ser 
20 25 30 

Asp Ser Glu Glu Glu Asn Asp Glu lie Asp Gly Val Asn His Gin His 
35 40 45 

Leu Pro Ala Arg Arg Ala Glu Pro Gin Arg His Thr Met Leu Cys Met 
50 55 60 

Cys Cys Lys Cys Glu Ala Arg lie Glu Leu Val Val Glu Ser Ser Ala 
65 70 75 80 

Asp Asp Leu Arg Ala Phe Gin Gin Leu Phe Leu Asn Thr Leu Ser Phe 
85 90 95 

Val Cys Pro Trp Cys Ala Ser Gin Gin 
100 105 



<210> 8 

<211> 4839 

<212> DNA 

<213> Homo sapiens 



<400> 8 

tccggttttt 

gcgtgccccg 

cgccgcggaa 

ctgccgccgc 

aggacagcgg 

ctgattttac 

ggttaacttg 

aaaaggaact 

tcacttttac 

taaaagaaat 

atgatgtatt 

cacaacccag 

ggatcacatt 

catttcagtt 

tcaaagaacc 

gaggtcagaa 

aagttctctg 



ctcaggggac 
cgtgcgcgcg 
aggcgtcatg 
ggaacccccg 
cccggaggac 
tgcattatgt 
ggagaaagtt 
gtggggaatc 
tgagctacag 
tgataccagt 
gtttgcactc 
cagttcgata 
tttattagct 
aatgctatgt 
atataaaaca 
caggagtgca 
taaagaacat 



gttgaaatta 
cgtcgtcctc 
ccgcccaaaa 
gcaccgccgc 
ctgcctctcg 
cagaaattaa 
tcatctgtgg 
tgtatcttta 
aaaaacatag 
accaaagttg 
ttcagcaaat 
tctactgaaa 
aaaggggaag 
gtccttgact 
gctgttatac 
cggatagcaa 
gaatgtaata 



tttttgtaac 
cccggcgctc 
ccccccgaaa 
cgccgccccc 
tcaggcttga 
agataccaga 
atggagtatt 
ttgcagcagt 
aaatcagtgt 
ataatgctat 
tggaaaggac 
taaattctgc 
tattacaaat 
attttattaa 
ccattaatgg 
aacaactaga 
tagatgaggt 



gggagtcggg 
ctccacagct 
aacggccgcc 
tcctgaggag 
gtttgaagaa 
tcatgtcaga 
gggaggttat 
tgacctagat 
ccataaattc 
gtcaagactg 
atgtgaactt 
attggtgcta 
ggaagatgat 
actctcacct 
ttcacctcga 
aaatgataca 
gaaaaatgtt 



agaggacggg 
cgctggctcc 
accgccgccg 
gacccagagc 
acagaagaac 
gagagagctt 
attcaaaaga 
gagatgtcgt 
tttaacttac 
ttgaagaagt 
atatatttga 
aaagtttctt 
ctggtgattt 
cccatgttgc 
acacccaggc 
agaattattg 
tatttcaaaa 



60 
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240 

300 
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1020 



attttatacc ttttatgaat tctcttggac 
aaaatctttc taaacgatac gaagaaattt 
tatttttgga tcatgataaa actcttcaga 
gaacaccacg aaaaagtaac cttgatgaag 
ttaggactgt tatgaacact atccaacaat 
aaccttcaga aaatctgatt tcctatttta 
tactgaaaag agtgaaggat ataggataca 
gacagggttg tgtcgaaatt ggatcacagc 
gagtaatgga atccatgctt aaatcagaag 
aacttctgaa tgacaacatt tttcatatgt 
tggccacata tagcagaagt acatctcaga 
catggattct gaatgtgctt aatttaaaag 
ttatcaaagc agaaggcaac ttgacaagag 
atcgaatcat ggaatccctt gcatggctct 
aatcaaagga ccgagaagga ccaactgatc 
ctctccagaa taatcacact gcagcagata 
aaaaaggttc aactacgcgt gtaaattcta 
ccttccagac ccagaagcca ttgaaatcta 
atcggctagc ctatctccgg ctaaatacac 
aattagaaca tatcatctgg acccttttcc 
tgagagacag gcatttggac caaattatga 
agaatataga ccttaaattc aaaatcattg 
ttcaggagac attcaaacgt gttttgatca 
tctataactc ggtcttcatg cagagactga 
ggccccctac cttgtcacca atacctcaca 
cacccttacg gattcctgga gggaacatct 
tttcagaagg tctgccaaca ccaacaaaaa 
ttggtgaatc attcgggact tctgagaagt 
gcgaccgtgt gctcaaaaga agtgctgaag 
tacgctttga tattgaagga tcagatgaag 
ccaaatttca gcagaaactg gcagaaatga 
aaatgaatga tagcatggat acctcaaaca 
gtggacactg tgtacacctc tggattcatt 
ccaggttctg tttatggcca catttaatat 
cagatgcaat tgtttgggtg attcctaagc 
caagattgaa aatcttgtgt aaatcctgcc 
ttccaaagta aaattgctgt gctttatgga 
gataacccag gcctgtctga ctactttgcc 
cttgttttta ttaatttata tgtatatttt 
gtgtcctatc tatcttccaa atgcaatttg 
aactcttctg caaaaatgga tattattaga 
tagattttat tttactattg gaatctgata 
tttaattaaa taaaagctgg aagcaaagta 
cagatttcat acctcagaat gtaaaagaac 
tttttaaatg aggattattg atagtactct 
tataaagtac ccatctagta cttgaaaaag 
ggaccctaac acagtatatc ccaagtgcac 
agatacaaat taattttact ccataaacag 
ttcatagaga tttgtctaat tgcatctcaa 
tttgtgtttt ctctggaatg gtacatgtct 
atttatcttt tattttttta agtcagtatg 
atttctagtc caaaattaca agtaatcaag 
atctgatttt gtgcaaaagc ttcaaattaa 
cccctcccct acacctaaag gtgtatttaa 
tgctgtaact taaaataggg gatatttaag 
tttgtctaac tcagaattat ttttaaaaag 
attttgtgct catttaagtt tcaaacttac 



ttgtaacatc taatggactt ccagaggttg 1080 
atcttaaaaa taaagatcta gatgcaagat 1140 
ctgattctat agacagtttt gaaacacaga 1200 
aggtgaatgt aattcctcca cacactccag 1260 
taatgatgat tttaaattca gcaagtgatc 1320 
acaactgcac agtgaatcca aaagaaagta 1380 
tctttaaaga gaaatttgct aaagctgtgg 1440 
gatacaaact tggagttcgc ttgtattacc 1500 
aagaacgatt atccattcaa aattttagca 1560 
ctttattggc gtgcgctctt gaggttgtaa 1620 
atcttgattc tggaacagat ttgtctttcc 1680 
cctttgattt ttacaaagtg atcgaaagtt 1740 
aaatgataaa acatttagaa cgatgtgaac 1800 
cagattcacc tttatttgat cttattaaac 1860 
accttgaatc tgcttgtcct cttaatcttc 1920 
tgtatctttc tcctgtaaga tctccaaaga 1980 
ctgcaaatgc agagacacaa gcaacctcag 2040 
cctctctttc actgttttat aaaaaagtgt 2100 
tttgtgaacg ccttctgtct gagcacccag 2160 
agcacaccct gcagaatgag tatgaactca 2220 
tgtgttccat gtatggcata tgcaaagtga 2280 
taacagcata caaggatctt cctcatgctg 2340 
aagaagagga gtatgattct attatagtat 2400 
aaacaaatat tttgcagtat gcttccacca 2460 
ttcctcgaag cccttacaag tttcctagtt 2520 
atatttcacc cctgaagagt ccatataaaa 2580 
tgactccaag atcaagaatc ttagtatcaa 2640 
tccagaaaat aaatcagatg gtatgtaaca 2700 
gaagcaaccc tcctaaacca ctgaaaaaac 2760 
cagatggaag taaacatctc ccaggagagt 2820 
cttctactcg aacacgaatg caaaagcaga 2880 
aggaagagaa atgaggatct caggaccttg 2940 
gtctctcaca gatgtgactg tataactttc 3000 
cttcagctct ttttgtggat ataaaatgtg 3060 
cacttgaaat gttagtcatt gttatttata 3120 
atttaaaaag ttgtagcaga ttgtttcctc 3180 
tagtaagaat ggccctagag tgggagtcct 3240 
ttcttttgta gcatataggt gatgtttgct 3300 
tttaatttaa catgaacacc cttagaaaat 3360 
attgactgcc cattcaccaa aattatcctg 3420 
aattagaaaa aaattactaa ttttacacat 3480 
tactgtgtgc ttgttttata aaattttgct 3540 
taaccatatg atactatcat actactgaaa 3600 
ttactgatta ttttcttcat ccaacttatg 3660 
tggtttttat accattcaga tcactgaatt 3720 
taaagtgttc tgccagatct taggtataga 3780 
tttctaatgt ttctgggtcc tgaagaatta 3840 
actgttaatt ataggagcct taattttttt 3900 
aattattctg ccctccttaa tttgggaagg 3960 
tccatgtatc ttttgaactg gcaattgtct 4020 
gtctaacact ggcatgttca aagccacatt 4080 
ggtcattatg ggttaggcat taatgtttct 4140 
aacagctgca ttagaaaaag aggcgcttct 4200 
actatcttgt gtgattaact tatttagaga 4260 
gtagcttcag ctagctttta ggaaaatcac 4320 
aaatctggtc ttgttagaaa acaaaatttt 4380 
tattttgaca gttattttga taacaatgac 4440 



actagaaaac ttgactccat ttcatcattg 
gtttttaggt caagggctta ctatttctgg 
agtgccagaa ttttaggaac ttcagagatc 
gatattattg ctttattgct tttttgtatt 
ttactatttt ctacaattaa tagtttgtct 
taatggtctg atgttgtgtt ctttgtatta 
agaagataga tagaagataa ctctcctagt 



tttctgcatg aatatcatac aaatcagtta 4500 
gtcttttgct actaagttca cattagaatt 4560 
gtgtattgag atttcttaaa taatgcttca 4 620 
ggttaaaact gtacatttaa aattgctatg 4 680 
attttaaaat aaattagttg ttaagagtct 4740 
agtacactaa tgttctcttt tctgtctagg 4800 
atctcatcc 4839 



<210> 9 

<211> 987 

<212> DNA 

<213> Homo sapiens 

<400> 9 

cggagagggg gagaacagac aacgggcggc 
cagcatggag ccttcggctg actggctggc 
ggtgcgggcg ctgctggagg cgggggcgct 
gccgatccag gtcatgatga tgggcagcgc 
cgcggagccc aactgcgccg accccgccac 
ggagggcttc ctggacacgc tggtggtgct 
cgatgcctgg ggccgtctgc ccgtggacct 
acggtacctg cgcgcggctg cggggggcac 
cgcggaaggt ccctcagaca tccccgattg 
ggaaacttag atcatcagtc accgaaggtc 
ccaccccgct ttcgtagttt tcatttagaa 
aacgtagata taagccttcc cccactaccg 
ttcttataaa aatgtaaaaa agaaaaacac 
tctggagtga gcactcacgc cctaagcgca 
cagcctccgg aagctgtcga cttcatgaca 
ggttactggc ttctcttgag tcacactgct 
aataaaataa ttttcattca ttcactc 



ggggagcagc atggagccgg cggcggggag 60 
cacggccgcg gcccggggtc gggtagagga 120 
gcccaacgca ccgaatagtt acggtcggag 180 
ccgagtggcg gagctgctgc tgctccacgg 240 
tctcacccga cccgtgcacg acgctgcccg 300 
gcaccgggcc ggggcgcggc tggacgtgcg 360 
ggctgaggag ctgggccatc gcgatgtcgc 420 
cagaggcagt aaccatgccc gcatagatgc 480 
aaagaaccag agaggctctg agaaacctcg 540 
ctacagggcc acaactgccc ccgccacaac 600 
aatagagctt ttaaaaatgt cctgcctttt 660 
taaatgtcca tttatatcat tttttatata 720 
cgcttctgcc ttttcactgt gttggagttt 780 
cattcatgtg ggcatttctt gcgagcctcg 840 
agcattttgt gaactaggga agctcagggg 900 
agcaaatggc agaaccaaag ctcaaataaa 960 

987 



<210> 10 

<211> 1181 

<212> DNA 

<213> Homo sapiens 

<400> 10 

atggaggagc cgcagtcaga tcctagcgtc 
gacctatgga aactacttcc tgaaaacaac 
gatgatttga tgctgtcccc ggacgatatt 
gatgaagctc ccagaatgcc agaggctgct 
acaccggcgg cccctgcacc agccccctcc 
aaaacctacc agggcagcta cggtttccgt 
tctgtgactt gcacgtactc ccctgccctc 
tgccctgtgc agctgtgggt tgattccaca 
gccatctaca agcagtcaca gcacatgacg 
cgctgctcag atagcgatgg tctggcccct 
ttgcgtgtgg agtatttgga tgacagaaac 
gagccgcctg aggttggctc tgactgtacc 
tcctgcatgg gcggcatgaa ccggaggccc 
agtggtaatc tactgggacg gaacagcttt 
gaccggcgca cagaggaaga gaatctccgc 
ccagggagca ctaagcgagc actgcccaac 
aaaccactgg atggagaata tttcaccctt 



gagccccctc tgagtcagga aacattttca 60 
gttctgtccc ccttgccgtc ccaagcaatg 120 
gaacaatggt tcactgaaga cccaggtcca 180 
ccccccgtgg cccctgcacc agcagctcct 240 
tggcccctgt catcttctgt cccttcccag 300 
ctgggcttct tgcattctgg gacagccaag 360 
aacaagatgt tttgccaact ggccaagacc 420 
cccccgcccg gcacccgcgt ccgcgccatg 480 
gaggttgtga ggcgctgccc ccaccatgag 540 
cctcagcatc ttatccgagt ggaaggaaat 600 
acttttcgac atagtgtggt ggtgccctat 660 
accatccact acaactacat gtgtaacagt 720 
atcctcacca tcatcacact ggaagactcc 780 
gaggtgcgtg tttgtgcctg tcctgggaga 840 
aagaaagggg agcctcacca cgagctgccc 900 
aacaccagct cctctcccca gccaaagaag 960 
cagatccgtg ggcgtgagcg cttcgagatg 1020 



ttccgagagc tgaatgaggc cttggaactc aaggatgccc aggctgggaa ggagccaggg 1080 
gggagcaggg ctcactccag ccacctgaag tccaaaaagg gtcagtctac ctcccgccat 1140 
aaaaaactca tgttcaagac agaagggcct gactcagact g 1181 



<210> 11 
<211> 540 
<212> DNA 

<213> Homo sapiens 
<400> 11 

cgcgcctgcg gggcggagat gggcaggggg cggtgcgtgg gtcccagtct gcagttaagg 60 
gggcaggagt ggcgctgctc acctctggtg ccaaagggcg gcgcagcggc tgccgagctc 120 
ggccctggag gcggcgagaa catggtgcgc aggttcttgg tgaccctccg gattcggcgc 180 
gcgtgcggcc cgccgcgagt gagggttttc gtggttcaca tcccgcggct cacgggggag 240 
tgggcagcgc caggggcgcc cgccgctgtg gccctcgtgc tgatgctact gaggagccag 300 
cgtctagggc agcagccgct tcctagaaga ccaggtcatg atgatgggca gcgcccgagt 360 
ggcggagctg ctgctgctcc acggcgcgga gcccaactgc gccgaccccg ccactctcac 420 
ccgacccgtg cacgacgctg cccgggaggg cttcctggac acgctggtgg tgctgcaccg 480 
ggccggggcg cggctggacg tgcgcgatgc ctggggccgt ctgcccgtgg acctggctga 540 



<210> 12 
<211> 26 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: internalizing 
peptide 

<400> 12 

Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly Arg Lys Lys Arg 
15 10 15 

Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser 
20 25 



<210> 13 
<211> 12 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: EGF derived 
peptide 

<400> 13 

Cys Met His He Glu Ser Leu Asp Ser Tyr Thr Cys 
1 5 10 



<210> 14 
<211> 12 
<212> PRT 

<213> Unknown Organism 



<220> 

<223> Description of Unknown Organism: EGF derived 
peptide 

<400> 14 

Cys Met Tyr lie Glu Ala Leu Asp Lys Tyr Ala Cys 
1 5 . 10 



<210> 15 
<211> 32 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: internalizing 
peptide 

<220> 

<221> MOD_RES 
<222> (1) 

<223> unique residue, such as cysteine or lysine, that 
facilitates chemical conjugation of the 
internalizing peptide to a target protein 
conjugate . 

<220> 

<221> MOD_RES 
<222> (2) . . (3) 

<223> amino acid residue selected to modulate the 
affinity of the internalizing peptide for 
different membranes. 

<400> 15 

Xaa Xaa Xaa Glu Ala Ala Leu Ala Glu Ala Leu Ala Glu Ala Leu Ala 
15 10 15 

Glu Ala Leu Ala Glu Ala Leu Ala Glu Ala Leu Glu Ala Leu Ala Ala 
20 25 30 

<210> 16 
<211> 8 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: peptide substrate 
<400> 16 

Gly Asn Ala Ala Ala Ala Arg Arg 
1 5 



<210> 17 
<211> 10 
<212> PRT 



<213> Unknown Organism 



<220> 

<223> Description of Unknown Organism: laminin derived 
peptide 

<400> 17 

Cys Asp Pro Gly Tyr lie Gly Ser Arg Cys 
15 10 



<210> 18 
<211> 75 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: Ndel-EcoRl 
fragment 

<400> 18 

catatgggtg gctgccgtgg cgatatgttc ggttgcggtg ctcctccaaa aaagaagaga 60 
aaggtagctg gattc 75 

<210> 19 
<211> 24 
<212> PRT 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: RGD/SV40 peptide 
<400> 19 

Met Gly Gly Cys Arg Gly Asp Met Phe Gly Cys Gly Ala Pro Pro Lys 
15 10 15 

Lys Lys Arg Lys Val Ala Gly Phe 
20 



<210> 20 
<211> 225 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: Ndel-EcoRl 
fragment 

<400> 20 

catatggagc cagtagatcc tagactagag ccctggaagc atccaggaag tcagcctaaa 60 
actgcttgta ccaattgcta ttgtaaaaag tgttgctttc attgccaagt ttgtttcata 120 
acaaaagccc ttggcatctc ctatggcagg aagaagcgga gacagcgacg aagacctcct 180 
caaggcagtc agactcatca agtttctcta agtaagcaag gattc 225 



<210> 21 
<211> 72 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: engineered HIV-1 tat 
<400> 21 

Met Glu Pro Val Asp Pro Arg Leu Glu Pro Trp Lys His Pro Gly Ser 
15 10 15 

Gin Pro Lys Thr Ala Cys Thr Asn Cys Tyr Cys Lys Lys Cys Cys Phe 
20 25 30 

His Cys Gin Val Cys Phe He Thr Lys Ala Leu Gly He Ser Tyr Gly 
35 40 45 

Arg Lys Lys Arg Arg Gin Arg Arg Arg Pro Pro Gin Gly Ser Gin Thr 
50 55 60 

His Gin Val Ser Leu Ser Lys Gin 
65 70 



<210> 22 
<211> 912 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: Ndel-EcoRl 
fragment 

<400> 22 

catatgacct ctcgccgctc cgtgaagtcg ggtccgcggg aggttccgcg cgatgagtac 60 

gaggatctgt actacacccc gtcttcaggt atggcgagtc ccgatagtcc gcctgacacc 120 

tcccgccgtg gcgccctaca gacacgctcg cgccagaggg gcgaggtccg tttcgtccag 180 

tacgacgagt cggattatgc cctctacggg ggctcgtcat ccgaagacga cgaacacccg 240 

gaggtccccc ggacgcggcg tcccgtttcc ggggcggttt tgtccggccc ggggcctgcg 300 

cgggcgcctc cgccacccgc tgggtccgga ggggccggac gcacacccac caccgccccc 360 

cgggcccccc gaacccagcg ggtggcgact aaggcccccg cggccccggc ggcggagacc 420 

acccgcggca ggaaatcggc ccagccagaa tccgccgcac tcccagacgc ccccgcgtcg 480 

acggcgccaa cccgatccaa gacacccgcg caggggctgg ccagaaagct gcactttagc 54 0 

accgcccccc caaaccccga cgcgccatgg accccccggg tggccggctt taacaagcgc 600 

gtcttctgcg ccgcggtcgg gcgcctggcg gccatgcatg cccggatggc ggcggtccag 660 

ctctgggaca tgtcgcgtcc gcgcacagac gaagacctca acgaactcct tggcatcacc 720 

accatccgcg tgacggtctg cgagggcaaa aacctgcttc agcgcgccaa cgagttggtg 780 

aatccagacg tggtgcagga cgtcgacgcg gccacggcga ctcgagggcg ttctgcggcg 840 

tcgcgcccca ccgagcgacc tcgagcccca gcccgctccg cttctcgccc cagacggccc 900 
gtcgaggaat tc 912 



<210> 23 
<211> 301 
<212> PRT 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: engineered HSV-1 VP22 



<400> 23 

Met Thr Ser Arg Arg Ser Val Lys Ser Gly Pro Arg Glu Val Pro Arg 
15 10 15 

Asp Glu Tyr Glu Asp Leu Tyr Tyr Thr Pro Ser Ser Gly Met Ala Ser 
20 25 30 

Pro Asp Ser Pro Pro Asp Thr Ser Arg Arg Gly Ala Leu Gin Thr Arg 
35 40 45 

Ser Arg Gin Arg Gly Glu Val Arg Phe Val Gin Tyr Asp Glu Ser Asp 
50 55 60 

Tyr Ala Leu Tyr Gly Gly Ser Ser Ser Glu Asp Asp Glu His Pro Glu 
65 70 75 80 

Val Pro Arg Thr Arg Arg Pro Val Ser Gly Ala Val Leu Ser Gly Pro 
85 90 95 

Gly Pro Ala Arg Ala Pro Pro Pro Pro Ala Gly Ser Gly Gly Ala Gly 
100 105 110 

Arg Thr Pro Thr Thr Ala Pro Arg Ala Pro Arg Thr Gly Arg Val Ala 
115 120 125 

Thr Lys Ala Pro Ala Ala Pro Ala Ala Glu Thr Thr Arg Gly Arg Lys 
130 135 140 

Ser Ala Gin Pro Glu Ser Ala Ala Leu Pro Asp Ala Pro Ala Ser Thr 
145 150 155 160 

Ala Pro Thr Arg Ser Lys Thr Pro Ala Gin Gly Leu Ala Arg Lys Leu 
165 170 175 

His Phe Ser Thr Ala Pro Pro Asn Pro Asp Ala Pro Trp Thr Pro Arg 
180 185 190 

Val Ala Gly Phe Asn Lys Arg Val Phe Cys Ala Ala Val Gly Arg Leu 
195 200 205 

Ala Ala Met His Ala Arg Met Ala Ala Val Gin Leu Trp Asp Met Ser 
210 215 220 

Arg Pro Arg Thr Asp Glu Asp Leu Asn Glu Leu Leu Gly He Thr Thr 
225 230 235 240 

He Arg Val Thr Val Cys Glu Gly Lys Asn Leu Leu Gin Arg Ala Asn 
245 250 255 



Glu Leu Val Asn Pro Asp Val Val Gin Asp Val Asp Ala Ala Thr Ala 
260 265 270 



Thr Arg Gly Arg Ser Ala Ala Ser Arg Pro Thr Glu Arg Pro Arg Ala 



275 



280 



285 



Pro Ala Arg Ser Ala Ser Arg Pro Arg Arg Pro Val Glu 
290 295 300 



<210> 24 
<211> 120 
<212> DNA 

<213> Unknown Organism 
<220> 

<223> Description of Unknown Organism: Ndel-EcoRl 
fragment 

<400> 24' 

catatggacg tcgacgcggc cacggcgact cgagggcgtt ctgcggcgtc gcgccccacc 60 
gagcgacctc gagccccagc ccgctccgct tctcgcccca gacggcccgt cgaggaattc 120 



<210> 25 
<211> 37 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: VP22 (C-terminal domain) 
<400> 25 

Met Asp Val Asp Ala Ala Thr Ala Thr Arg Gly Arg Ser Ala Ala Ser 
15 10 15 

Arg Pro Thr Glu Arg Pro Arg Ala Pro Ala Arg Ser Ala Ser Arg Pro 
20 25 30 

Arg Arg Pro Val Glu 
35 



<210> 26 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: illustrative, 
peptide 

<400> 26 
Cys Ala Ala Met 
1 



<210> 27 
<211> 4 
<212> PRT 

<213> Artificial Sequence 



# 



<220> 

<223> Description of Artificial Sequence: illustrative 
peptide 

<400> 27 
Cys Ala Ala Ser 
1 



<210> 28 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 28 

aatccgtcga gcagagtt 18 



<210> 29 
<211> 24 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 29 

gcgcggctaa ccctaaccct aacc 24 



<210> 30 
<211> 1031 
<212> PRT 

<213> Euplotes sp. 
<400> 30 

Met Glu Val Asp Val 
1 5 

Ala Leu Lys Thr Cys 
20 

Asn lie Gin Lys Val 
35 

Lys Asp Leu Glu Asp 
50 

Thr Pro Arg Asp Tyr 
65 

Glu Val Phe Ser Thr 
85 



Asp Asn Gin Ala Asp Asn 
10 

Glu Glu lie Lys Glu Ala 
25 

lie Arg Cys Arg Asn Gin 
40 

lie Lys lie Phe Ala Gin 
55 

Asn Glu Glu Asp Phe Lys 
70 75 

Gly Leu Met He Glu Leu 
90 



His Gly He His Ser 
15 

Lys Thr Leu Tyr Ser 
30 

Ser Gin Ser His Tyr 
45 

Thr Asn He Val Ala 
60 

Val He Ala Arg Lys 
80 

He Asp Lys Cys Leu 
95 



Val Glu Leu Leu Ser Ser Ser Asp Val Ser Asp Arg Gin Lys Leu Gin 
100 105 110 



Cys Phe Gly Phe Gin Leu Lys Gly Asn Gin Leu Ala Lys 
115 120 125 



Thr His Leu 



Leu Thr 
130 



Ala Leu Ser Thr 



Gin Lys Gin Tyr 
135 



Phe Phe Gin 
140 



Asp Glu Trp 



Asn Gin 
145 

Thr Lys 
Phe Cys 
Asp Lys 



Cys Ser 
210 



Val Arg 
Tyr Leu 



Gly Asn 
180 



Ala Met 

150 

He Phe 
165 

Asn Val 



Lys Gin 

195 ■ 



Lys Gly 
Thr Cys Lys Tyr 



He Gly Asn Glu 



Gin Arg Thr Ser 
170 

Phe Asp His Leu 
185 

Gly Ala Ala Asp 
200 

Asn Val Lys Asn 
215 



Leu Phe Arg 
155 

Glu Gly Thr 
Lys Val Asn 



Met Asn Glu 
205 

Glu Lys Asp 
220 



His Leu Tyr 
160 

Leu Val Gin 
175 

Asp Lys Phe 
190 

Pro Arg Cys 



His Phe Leu 



Asn Asn 
225 



He Asn 



Val Pro 
230 



Asn Asn Asn Asn 



Met Lys Ser 
235 



Arg Thr Arg 
240 



He Phe Tyr Cys 



Thr His 
245 



Phe Asn Arg Asn 
250 



Asn Gin Phe 



Phe Lys Lys 
255 



His Glu Phe Val Ser Asn Lys Asn Asn lie Ser Ala Met Asp Arg Ala 
260 265 270 



Gin Thr 



Leu Lys 
290 



He Phe Thr Asn He Phe Arg Phe 
275 280 



Asp Lys Val He 



Glu Lys He Ala 
295 



Asn Arg He 
285 

Tyr Met Leu 
300 



Arg Lys Lys 
Glu Lys Val 



Lys Asp 
305 



Phe Asn 



Phe Asn 
310 



Tyr Tyr Leu Thr 



Lys Ser Cys 
315 



Pro Leu Pro 
320 



Glu Asn Asn Arg 



Glu Arg Lys Gin Lys He 
325 330 



Glu Asn Leu 



lie Asn Lys 
335 



Thr Arg 
Thr Asp 



lie Leu 
370 



Glu Glu 
340 



Lys Ser 
Cys Val 
Pro Lys Asp Phe 



Asn Lys 
355 



Lys Tyr Tyr Glu 
345 

Thr Gin Phe lie 
360 

Leu Thr Gly Arg 
375 



Glu Leu Phe 



Asn Glu Phe 
365 

Asn Arg Lys 
380 



Ser Tyr Thr 
350 

Phe Tyr Asn 



Asn Phe Gin 



Lys Lys Val Lys Lys Tyr Val Glu Leu Asn Lys His Glu Leu He His 
385 390 395 400 



Lys Asn Leu Leu Leu Glu Lys lie Asn Thr Arg Glu lie Ser Trp Met 
405 410 415 

Gin Val Glu Thr Ser Ala Lys His Phe Tyr Tyr Phe Asp His Glu Asn 
420 425 430 

lie Tyr Val Leu Trp Lys Leu Leu Arg Trp lie Phe Glu Asp Leu Val 
435 440 445 

Val Ser Leu lie Arg Cys Phe Phe Tyr Val Thr Glu Gin Gin Lys Ser 
450 455 460 

Tyr Ser Lys Thr Tyr Tyr Tyr Arg Lys Asn lie Trp Asp Val lie Met 
465 470 475 480 

Lys Met Ser lie Ala Asp Leu Lys Lys Glu Thr Leu Ala Glu Val Gin 
485 490 495 

Glu Lys Glu Val Glu Glu Trp Lys Lys Ser Leu Gly Phe Ala Pro Gly 
500 505 510 

Lys Leu Arg Leu lie Pro Lys Lys Thr Thr Phe Arg Pro lie Met Thr 
515 520 525 

Phe Asn Lys Lys lie Val Asn Ser Asp Arg Lys Thr Thr Lys Leu Thr 
530 535 540 

Thr Asn Thr Lys Leu Leu Asn Ser His Leu Met Leu Lys Thr Leu Lys 
545 550 555 560 

Asn Arg Met Phe Lys Asp Pro Phe Gly Phe Ala Val Phe Asn Tyr Asp 
565 570 575 

Asp Val Met Lys Lys Tyr Glu Glu Phe Val Cys Lys Trp Lys Gin Val 
580 585 590 

Gly Gin Pro Lys Leu Phe Phe Ala Thr Met Asp lie Glu Lys Cys Tyr 
595 600 605 

Asp Ser Val Asn Arg Glu Lys Leu Ser Thr Phe Leu Lys Thr Thr Lys 
610 615 620 

Leu Leu Ser Ser Asp Phe Trp lie Met Thr Ala Gin lie Leu Lys Arg 
625 630 635 640 

Lys Asn Asn lie Val lie Asp Ser Lys Asn Phe Arg Lys Lys Glu Met 
645 650 655 

Lys Asp Tyr Phe Arg Gin Lys Phe Gin Lys lie Ala Leu Glu Gly Gly 
660 665 670 



Gin Tyr Pro Thr Leu Phe Ser Val Leu Glu Asn Glu Gin Asn Asp Leu 
675 680 685 



Asn Ala Lys Lys Thr Leu lie Val Glu Ala Lys Gin Arg Asn Tyr Phe 
690 695 700 



Lys Lys Asp Asn Leu Leu Gin Pro Val He Asn He Cys Gin Tyr Asn 
705 710 715 720 

Tyr He Asn Phe Asn Gly Lys Phe Tyr Lys Gin Thr Lys Gly He Pro 
725 730 735 

Gin Gly Leu Cys Val Ser Ser He Leu Ser Ser Phe Tyr Tyr Ala Thr 
740 745 750 

Leu Glu Glu Ser Ser Leu Gly Phe Leu Arg Asp Glu Ser Met Asn Pro 
755 760 765 

Glu Asn Pro Asn Val Asn Leu Leu Met Arg Leu Thr Asp Asp Tyr Leu 
770 775 780 

Leu He Thr Thr Gin Glu Asn Asn Ala Val Leu Phe He Glu Lys Leu 
785 790 795 800 

He Asn Val Ser Arg Glu Asn Gly Phe Lys Phe Asn Met Lys Lys Leu 
805 810 815 

Gin Thr Ser Phe Pro Leu Ser Pro Ser Lys Phe Ala Lys Tyr Gly Met 
820 825 830 

Asp Ser Val Glu Glu Gin Asn He Val Gin Asp Tyr Cys Asp Trp He 
835 840 845 

Gly He Ser He Asp Met Lys Thr Leu Ala Leu Met Pro Asn He Asn 
850 855 860 

Leu Arg He Glu Gly He Leu Cys Thr Leu Asn Leu Asn Met Gin Thr 
865 870 875 880 

Lys Lys Ala Ser Met Trp Leu Lys Lys Lys Leu Lys Ser Phe Leu Met 
885 890 895 

Asn Asn He Thr His Tyr Phe Arg Lys Thr He Thr Thr Glu Asp Phe 
900 905 910 

Ala Asn Lys Thr Leu Asn Lys Leu Phe He Ser Gly Gly Tyr Lys Tyr 
915 920 925 

Met Gin Cys Ala Lys Glu Tyr Lys Asp His Phe Lys Lys Asn Leu Ala 
930 935 940 

Met Ser Ser Met He Asp Leu Glu Val Ser Lys He He Tyr Ser Val 
945 950 955 960 

Thr Arg Ala Phe Phe Lys Tyr Leu Val Cys Asn He Lys Asp Thr He 
965 970 975 



Phe Gly Glu Glu His Tyr Pro Asp Phe Phe Leu Ser Thr Leu Lys His 
980 985 990 



Phe He Glu He Phe Ser Thr Lys Lys Tyr He Phe Asn Arg Val Cys 
995 1000 1005 



Met lie Leu Lys Ala Lys Glu Ala Lys Leu Lys Ser Asp Gin Cys Gin 
1010 1015 1020 



Ser Leu lie Gin Tyr Asp Ala 
1025 1030 



<210> 31 
<211> 884 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 31 

Met Lys lie Leu Phe Glu Phe lie Gin Asp Lys Leu Asp lie Asp Leu 
1 5 10 15 

Gin Thr Asn Ser Thr Tyr Lys Glu Asn Leu Lys Cys Gly His Phe Asn 
20 25 30 

Gly Leu Asp Glu lie Leu Thr Thr Cys Phe Ala Leu Pro Asn Ser Arg 
35 40 45 

Lys lie Ala Leu Pro Cys Leu Pro Gly Asp Leu Ser His Lys Ala Val 
50 55 60 

lie Asp His Cys lie lie Tyr Leu Leu Thr Gly Glu Leu Tyr Asn Asn 
65 70 75 80 

Val Leu Thr Phe Gly Tyr Lys lie Ala Arg Asn Glu Asp Val Asn Asn 
85 90 95 

Ser Leu Phe Cys His Ser Ala Asn Val Asn Val Thr Leu Leu Lys Gly 
100 105 110 

Ala Ala Trp Lys Met Phe His Ser Leu Val Gly Thr Tyr Ala Phe Val 
115 120 125 

Asp Leu Leu lie Asn Tyr Thr Val lie Gin Phe Asn Gly Gin Phe Phe 
130 135 140 

Thr Gin lie Val Gly Asn Arg Cys Asn Glu Pro His Leu Pro Pro Lys 
145 150 155 160 

Trp Val Gin Arg Ser Ser Ser Ser Ser Ala Thr Ala Ala Gin lie Lys 
165 170 175 

Trp Leu Thr Glu Pro Val Thr Asn Lys Gin Phe Leu His Lys Leu Asn 
180 185 190 

lie Asn Ser Ser Ser Phe Phe Pro Tyr Ser Lys lie Leu Pro Ser Ser 
195 200 205 

Ser Ser lie Lys Lys Leu Thr Asp Leu Arg Glu Ala lie Phe Pro Thr 
210 215 220 



Asn Leu Val Lys lie Pro Gin Arg Leu Lys Val Arg lie Asn Leu Thr 



o 



225 



230 



235 



240 



Leu Gin Lys Leu Leu Lys Arg His Lys Arg Leu Asn Tyr Val Ser lie 
245 250 255 

Leu Asn Ser lie Cys Pro Pro Leu Glu Gly Thr Val Leu Asp Leu Ser 
260 265 270 

His Leu Ser Arg Gin Ser Pro Lys Glu Arg Val Leu Lys Phe He He 
275 ■ 280 285 

Val He Leu Gin Lys Leu Leu Pro Gin Glu Met Phe Gly Ser Lys Lys 
290 295 300 

Asn Lys Gly Lys He He Lys Asn Leu Asn Leu Leu Leu Ser Leu Pro 
305 310 315 320 

Leu Asn Gly Tyr Leu Pro Phe Asp Ser Leu Leu Lys Lys Leu Arg Leu 
325 330 335 

Lys Asp Phe Arg Trp Leu Phe He Ser Asp lie Trp Phe Thr Lys His 
340 345 350 

Asn Phe Glu Asn Leu Asn Gin Leu Ala He Cys Phe He Ser Trp Leu 
355 360 365 

Phe Arg Gin Leu He Pro Lys He lie Gin Thr Phe Phe Tyr Cys Thr 
370 375 380 

Glu lie Ser Ser Thr Val Thr He Val Tyr Phe Arg His Asp Thr Trp 
385 390 395 400 

Asn Lys Leu He Thr Pro Phe He Val Glu Tyr Phe Lys Thr Tyr Leu 
405 410 415 

Val Glu Asn Asn Val Cys Arg Asn His Asn Ser Tyr Thr Leu Ser Asn 
420 425 430 

Phe Asn His Ser Lys Met Arg He He Pro Lys Lys Ser Asn Asn Glu 
435 440 445 

Phe Arg He He Ala He Pro Cys Arg Gly Ala Asp Glu Glu Glu Phe 
450 455 460 

Thr He Tyr Lys Glu Asn His Lys Asn Ala He Gin Pro Thr Gin .Lys 
465 470 475 480 

He Leu Glu Tyr Leu Arg Asn Lys Arg Pro Thr Ser Phe Thr Lys He 
485 490 495 

Tyr Ser Pro Thr Gin He Ala Asp Arg He Lys Glu Phe Lys Gin Arg 
500 * 505 510 

Leu Leu Lys Lys Phe Asn Asn Val Leu Pro Glu Leu Tyr Phe Met Lys 
515 520 525 



Phe Asp Val Lys Ser Cys Tyr Asp Ser lie Pro Arg Met Glu Cys Met 



530 



535 



540 



Arg lie Leu Lys Asp Ala Leu Lys Asn Glu Asn Gly Phe Phe Val Arg 
545 550 555 560 

Ser Gin Tyr Phe Phe Asn Thr Asn Thr Gly Val Leu Lys Leu Phe Asn 
565 570 575 

Val Val Asn Ala Ser Arg Val Pro Lys Pro Tyr Glu Leu Tyr lie Asp 
580 585 590 

Asn Val Arg Thr Val His Leu Ser Asn Gin Asp Val lie Asn Val Val 
595 600 605 

Glu Met Glu He Phe Lys Thr Ala Leu Trp Val Glu Asp Lys Cys Tyr 
610 615 620 

He Arg Glu Asp Gly Leu Phe Gin Gly Ser Ser Leu Ser Ala Pro He 
625 630 635 640 

Val Asp Leu Val Tyr Asp Asp Leu Leu Glu Phe Tyr Ser Glu Phe Lys 
645 650 655 

Ala Ser Pro Ser Gin Asp Thr Leu He Leu Lys Leu Ala Asp Asp Phe 
660 665 670 

Leu He He Ser Thr Asp Gin Gin Gin Val He Asn He Lys Lys Leu 
675 680 685 

Ala Met Gly Gly Phe Gin Lys Tyr Asn Ala Lys Ala Asn Arg Asp Lys 
690 695 700 

He Leu Ala Val Ser Ser Gin Ser Asp Asp Asp Thr Val He Gin Phe 
705 710 715 720 

Cys Ala Met His He Phe Val Lys Glu Leu Glu Val Trp Lys His Ser 
725 730 735 

Ser Thr Met Asn Asn Phe His He Arg Ser Lys Ser Ser Lys Gly He 
740 745 750 

Phe Arg Ser Leu He Ala Leu Phe Asn Thr Arg He Ser Tyr Lys Thr 
755 760 765 

lie Asp Thr Asn Leu Asn Ser Thr Asn Thr Val Leu Met Gin He Asp 
770 775 780 

His Val Val Lys Asn He Ser Glu Cys Tyr Lys Ser Ala Phe Lys Asp 
785 790 795 800 

Leu Ser He Asn Val Thr Gin Asn Met Gin Phe His Ser Phe Leu Gin 
805 810 815 

Arg lie lie Glu Met Thr Val Ser Gly Cys Pro lie Thr Lys Cys Asp 
820 825 830 



Pro Leu lie Glu Tyr Glu Val Arg Phe Thr He Leu Asn Gly Phe Leu 



835 840 845 

Glu Ser Leu Ser Ser Asn Thr Ser Lys Phe Lys Asp Asn lie lie Leu 
850 855 860 

Leu Arg Lys Glu He Gin His Leu Gin Ala Tyr He Tyr He Tyr He 
865 870 875 880 

His He Val Asn 



<210> 32 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 32 

Val Arg Arg Val Glu He Pro Lys 
1 5 



<210> 33 
<211> 12 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 33 

Phe Arg Pro Leu Ser Val Gly Asn Pro Arg Glu Lys 
15 10 



<210> 34 
<211> 17 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 34 

Tyr Cys Asn Trp Phe lie Lys Val Asp Leu Asn Lys Cys Phe Asp Thr 
15 10 15 

He 



<210> 35 
<211> 24 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 35 

Asn Tyr His Asn Thr Thr Leu Gly He Pro Gin Gly Ser Val Val Ser 
15 10 15 

Pro lie Leu Cys Asn He Phe Leu 
20 



• 



<210> 36 
<211> 13 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 36 

Tyr Phe Val Arg Tyr Ala Asp Asp He He He Gly Val 
15 10 



<210> 37 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 37 

Gly Met Ser He Asn He Asp Lys Ser Val He 
15 10 



<210> 38 
<211> 8 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 38 

Met Arg He Val Asn He Pro Lys 
1 5 



<210> 39 
<211> 12 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 39 

He Arg Pro Leu Ser Val Gly Asn Pro Arg Asp Lys 
15 10 



<210> 40 
<211> 17 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 40 

Gly Ser Asn Trp Phe Arg Glu Val Asp Leu Lys Lys Cys Phe Asp Thr 
15 10 15 

He 



<210> 41 
<211> 25 
<212> PRT 



<213> Saccharomyces cerevisiae 



<400> 41 

Thr Tyr His Lys Pro lie Leu Gly Leu Pro Gin Gly Ser Leu lie Pro 
1 5 10 . 15 

Ser Pro lie Leu Cys Asn lie Val lie 
20 25 



<210> 42 
<211> 13 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 42 

Lys Tyr Val Arg Tyr Ala Asp Asp lie Leu lie Gly Val 
15 10 



<210> 43 
<211> 11 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 43 

Gly Leu Thr lie Asn Glu Glu Lys Thr Leu lie 
15 10 



